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ABSTRACT 

We present a correlation function analysis for the catalogue of photometric red- 
shifts obtained from the Hubble Deep Field image by Fernandez-Soto et al., 1998. 
By dividing the catalogue into redshift bins of width Az = 0.4 we measured the an- 
gular correlation function w(9) as a function of redshift up to z ~ 4.8. From these 
measurements we derive the trend of the correlation length Tq. We find that ro(z) is 
roughly constant with look-back time up to z ~ 2, and then increases to higher values 
at z > 2.4. We estimate the values of ro, assuming £(r, z) — (r/r (z))~~< , 7 = 1.8 and 
different geometries. For fio = 1 we find r$(z = 3) ~ 7.00 ± 4.87 hr 1 Mpc, in good 
agreement with the values obtained from analysis of the Lyman Break Galaxies. 

Key words: galaxies: clustering - galaxies: general - cosmology: observations - large- 
scale structure 
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1 INTRODUCTION 

The evolution of galaxy clustering provides vital clues to the 
formation of galaxies and large-scale structure. The ampli- 
tude of galaxy clustering is determined by the combination 
of the evolution of the underlying mass fluctuations, and 
the bias relating the galaxy overdensities to mass. Obser- 
vationally the amplitude of the galaxy correlation function 
has been measured from redshift surveys extending up to 
redshifts z ~ 1 (e.g. CFRS, Le Fevre et al, 1995; CNOC2, 
Carlberg et al. 1998). The discovery of Lyman Break Galax- 
ies (Steidel et al., 1996) has allowed the scientific community 
to push this limit even further up to z ~ 3. However there is 
still a gap between the measurements obtained for z < 1 and 
those provided by the analysis of the Lyman Break Galaxies 
at redshifts z — 3. 

The aim of this letter is to "fill in" this gap by pre- 
senting measurements of the correlation function obtained 
from the catalogue of photometric redshifts derived from the 
Hubble Deep Field by Fernandez-Soto et al., 1998. Using the 
photometric redshifts we divide the catalogue into subsam- 
ples in redshift and measure the angular correlation function 
w(8) and its amplitude as a function of z up to z ~ 4.8. At 
low redshift our results agree with the results of Connolly et 
al. (1998), and at higher redshifts they are consistent with 
Lyman Break Galaxies. 

The outline of the paper is as follows: section 2 gives a 
description of the catalogue adopted for our analysis while 
section 3 presents the results for the angular correlation 
function. In section 4 we derive the meaningful spatial quan- 



tities as a function of redshift, while section 5 summarises 
our conclusions. 



2 THE DATA 

The Hubble Deep Field (HDF) image (Williams et al., 1996) 
covers an L-shaped area roughly 3' x 3', with a total area 
~ 4arcmin 2 and provides us with the deepest view of the 
Universe obtained so far. The image was obtained by the 
Hubble Space Telescope using the Wide Field Planetary 
Camera 2 (WFPC2) over a period of 10 consecutive days. 
The same image was acquired by using the broad-band fil- 
ters F300W, F450W, F60W and F814W in order to allow 
for the possibility of assigning photometric redshifts to the 
objects in the field. 

Since then various techniques for getting photometric 
redshifts have been applied to the HDF (see e.g. Connolly 
et al., 1998). The catalogue we will use for this work has been 
derived by Fernandez-Soto et al., 1998 by incorporating in 
their former technique (Lanzetta et al., 1996) the infrared 
images of the HDF acquired in the J(1.2/im), H(1.65/im) 
and K(2.2/im) broad-band filters (Dickinson et al., 1998). 
Their final catalogue includes 1067 objects, some of them 
at very high redshifts (z max ~ 6), as seen in the redshift 
distribution shown in Figure jjj. The edges of the Wide Field 
Camera images are of poorer quality than the bulk of the 
images, and Fernandez-Soto et al. use a magnitude limit of 
AB(8140)=28 for the inner part of the HDF, while the outer 
part of the image includes only objects with AB(8140)<26. 
Using the sensitivity map given in Fernandez-Soto et al., 
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Figure 1. Redshift distribution of the objects in the HDF as 
obtained by Fernandez-Soto et al., 1998 



1998 we rejected all those galaxies belonging to the shallower 
region; our final version of the catalogue, complete down to 
the magnitude AB(8140)=28, includes 946 objects. Figure 1 
shows the distribution of galaxies in the catalogue split into 
a series of narrow redshift intervals. 



3 THE ANGULAR CORRELATION 
FUNCTION 

Correlation-function analysis has become the standard way 
to quantify the clustering of different populations of as- 
tronomical sources. Ideally we would like to measure the 
spatial correlation function, but photometric redshifts are 
not precise enough to enable a direct measurement: the 
typical error between the photometric estimates and the 
spectroscopic measurements of z is Az rrns ~ 0.1 • (1 + z) 
(Az — z sp — Zphot) (see Fernandez-Soto et al., 1998), so that 
for instance Az rma ~ 0.4 at z = 3, corresponding to several 
hundred Mpc. Nevertheless the estimated redshifts can be 
used to select subsamples of galaxies at different redshifts, 
and so we can obtain estimates of the spatial clustering as 
a function of redshift via the angular correlation function. 
As pointed out by Connolly at al (1998), the angular cor- 
relation function from a redshift limited sample has much 
higher signal-to-noise than a comparable magnitude limited 
sample. 

The angular two-point correlation function w(8) gives 
the excess probability, with respect to a random Poisson 
distribution, of finding two sources in the solid angles 8Q,i 
5Q2 separated by an angle 8, and it is defined as 



5P = n 2 SQ 1 SQ 2 [1 + w(6)\ 



(1) 



where n is the mean number density of objects in the cat- 
alogue under consideration. One of the major limitations 
on the study of Large-Scale Structure with the HDF is its 
small field of view; for Qq = 1, 220 arcsecs correspond to 



X 

Figure 2. Spatial distribution of objects in the HDF with 
AB(8140)<28 in different redshifts intervals: 0.4 - 0.8, 0.8 - 1.2, 
1.2 - 1.6, 1.6 - 2.0, 2.0 - 2.4, 2.4 - 2.8, 2.8 -3.2, 3.2 - 4.0 and 
4.0 — 4.8. The resdhift binning is as in Figure H 



0.9/i -1 Mpc at z = 1, (see Connolly et al., 1998), so isolat- 
ing narrow redshift intervals will select galaxies in a small 
volume, and so lead to very large errors in the clustering 
analysis (e.g. a single cluster of galaxies could dominate the 
signal). We therefore decided to divide the sample into bins 
of width Az — 0.4 and consider for our analysis all the ob- 
jects in the "clean" catalogue (see section 2) up to redshifts 
z = 4.8. This redshift bin width corresponds to between 
one and two times the expected rms error in the redshifts. 
Given the small number of galaxies in the catalogue it might 
be thought that the signal-to-noise could be improved by us- 
ing broader bins to increase the number of objects per bin 
and so reduce the Poisson noise. However, increasing the 
redshift range in each bin would lead to a noticeable reduc- 
tion of the clustering signal due to the "washing out" of real 
structures caused by projection effects. As we will see later, 
cosmic variance is not a problem in our analysis. In fact, the 
presence of an overdense (underdense) region in the sample 
would result in a sudden rise (fall) of the clustering ampli- 
tude only in one particular redshift bin. The smooth trend 
of the correlation length ro as a function of z (see Figure 4) 
suggests this is not the case in our analysis. 

We then generated random catalogues containing 10000 
galaxies, with positions of the random objects lying within 
the area defined by the geometry of the photometric data, 
for each of the subsamples associated to a particular red- 
shift bin and then counted the number of distinct data-data 
pairs (DD), data-random pairs (DR), and distinct random- 
random pairs (RR) as a function of angular separation. We 
then calculate w using the estimator (Hamilton, 1993) 



DR' 1 



-1 



(2) 



in the angular scales 9 < 9 < 180 arcsecs. We also used the 
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Figure 3. The angular correlation function w for galaxies within the HDF with AB(8140)<28 at different redshifts intervals. The dashed 
lines show the best fits to the data. 



estimators suggested by Peebles (1980) and Landy & Szalay 
(1993), and found virtually identical results. In Figure |§] we 
show the results for w for different redshift bins; the error 
bars show Poisson estimates for the points. Since the distri- 
bution is clustered, these estimates only provide a lower limit 
to the uncertainties. Nevertheless it can be shown that, over 
the range of scales considered for the calculation of w(0), 
Poisson errors are comparable to those obtained from boot- 
strap resampling (Villumsen et al., 1997, see also Connolly 
et al., 1998). 

Note that in our analysis we did not include the results 
for < z < 0.4; this is due to the fact that the effect of 
excluding bright (nearby) galaxies in the construction of the 
HDF sample results in a spurious reduction of the clustering 
amplitude in that redshift range (Connolly et al., 1998). 

As shown in Figure ^| the angular correlation function 
is significantly positive at small scales for all the redshift 
bins considered in our analysis. The clustering amplitude 
is roughly constant within the errors up to z ~ 2.4; above 



this value the amplitude increases, and remains high for all 
z > 2.4 samples. 

The apparent clustering in the higher redshift samples 
may be partly affected by the problem of defining single 
galaxies from the irregular morphology of some of the ob- 
jects. However, this is a problem only for scales smaller than 
3", and we can see from visual inspection of the distribution 
of sources in Figure |i| that there are several clumps cover- 
ing 10-20 arcsec in each sample; it is these that generate 
the high clustering amplitude. To make sure the signal was 
not spurious, caused for example by difficulties in identifying 
single galaxies from complex irregular galaxies, we visually 
inspected the objects in each of these clumps. The individual 
galaxies appeared well separated and certainly not parts of 
single irregular objects. We conclude that the measurements 
represent real galaxy clustering. 

If we assume a power-law form for w(8) = AO ~ 7 , we 
can estimate the parameters A and 7, using a least-squares 
fit to the data. Given the large errors on w we assumed a 
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Figure 4. Trend of ro as a function of redshift as obtained from the analysis of the HDF photometric redshift catalogue by Fernandez- 
Soto et al., 1998. The left panel shows results for f2o = 1, A = and hg = 1, the panel in the middle shows results for flo = 0.4, A = 
and ho = 0.65 while the right panel is for do = 0.4, A = 0.6 and ho = 0.65. The filled points have been derived from considering redshift 
bins of width Az = 0.4, while the empty ones are for Az = 0.8. 



fixed value of 7 = 1.8. The small area of the HDF catalogue 
introduces a negative bias through the integral constraint, 
J w est dQ, = 0. We allow for this by fitting to AO 1 -"* - C, 
where C = 25^4, Furthermore, even though w(8) has been 
measured up to 9 ~ 0°.05, only angular scales less than half 
the sample size are likely to be reliable (see also Connolly et 
al., 1998), so we limit the fit to 6 ~ 0°.02. The dashed fines 
in Figure ^ represent the best fit for each redshift interval; 
the best values for the amplitude A as a function of redshift 
are listed in Table 1. These values coincide within the errors 
with the results obtained by Connolly et al. in their analysis 
of the clustering in the HDF for z < 1.4. 



4 RELATION TO SPATIAL QUANTITIES 

The standard way of relating the angular two-point correla- 
tion function w(6) to the spatial two-point correlation func- 
tion £(r, z) is by means of the relativistic Limber equation 
(Peebles, 1980): 



w(6) = 2 



Jo°° JT F- 2 (x)x 4 $ 2 (x)ar,z)dx du 
[Jo°° F- 1 (x)x 2 ^(x)dx] 2 



(•3) 



where x is the comoving coordinate, F(x) gives the correc- 
tion for curvature, and the selection function $(:r) satisfies 
the relation 



N - 



poo ^ pc 

I ${x)F- 1 {x)x 2 dx = -- / 
Jo ils Jo 



N{z)dz, 



(4) 



in which N is the mean surface density on a surface of solid 
angle £l s and N(z) is the number of objects in the given 
survey within the shell (z, z + dz). Given the small range of 
angular scales sampled by the HDF we can reasonably as- 



sume f (r, z) to have the redshift dependent power-law form 



r {z) 



(5) 



where all the dependence on z is included in the correla- 
tion scale length ro(z). The physical separation between two 
sources separated by an angle 6 is given (in the small angle 
approximation) by: 



(1 + z) \F 2 



+ x* 



1/2 



(6) 



By including equations (Q), (0) and (H) in equation Q 
we then get the following expression for the angular corre- 
lation function w(9) 

™ H '< JT ^) 2 m>, z) (x(z) 6) 1 -<ro{zyF(z) dz 

W ( )= : r.oo n2 ( 7 ) 



with H~t 



r[l/2]r[( 7 - l)/2]/T[ 7 /2] : 



in the case 



of 7 = 1.8, Ho the Hubble constant and fin is the density 
parameter, and P = dx/dz . 

If we consider a narrow redshift bin Az centred at some 
z we can consider N(z) constant in that interval. Under this 
assumption the expression for the correlation length ro(z) 
in comoving coordinates is given by: 



ro* = 



c Aaz Az 



Ho H-y x{zy-<P(Sl ,z) F(z) 



l/ 7 



(8) 



where A& z is the amplitude of the angular correlation func- 
tion w(0) for a particular redshift interval. Note that in prac- 
tice the error in redshifts (true Az) is the convolution of the 
measured Az with the error distribution. Assuming Gaus- 
sian error distribution, this will increase the effective Az in 
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^0 ( a ) 


'0 (U) 


ro (c) 


0.4 - 


0.8 


0.6 


151 


(0.7 ±0.4) 


10~ 


-3 


1.55 ± 0.85 


2.56 ± 1.42 


3.05 ± 1.68 


0.8 - 


1.2 


1.0 


173 


(0.6 ±0.4) 


10~ 


-3 


1.46 ± 1.08 


2.45 ± 1.81 


3.05 ± 2.26 


1.2 - 


1.6 


1.4 


165 


(1.2 ± 1.0) 


io- 


-3 


2.18 ± 2.06 


3.69 ±3.52 


4.69 ± 4.47 


1.6 - 


2.0 


1.8 


161 


(0.8 ±0.7) 


io- 


-3 


1.78 ± 1.71 


3.09 ±2.95 


3.94 ±3.77 


2.0 - 


2.4 


2.2 


66 


(1.6 ±2.3) 


io- 


-3 


2.60 ±4.18 


4.55 ± 7.31 


5.80 ±9.34 


2.4 - 


2.8 


2.6 


36 


(11 ±4.0) 


10" 


3 


7.58 ± 3.04 


13.9 ±5.36 


17.0 ±6.83 


2.8 - 


3.2 


3.0 


36 


(9.8 ±7.0) 


io- 


-3 


7.00 ± 4.87 


12.4 ±8.66 


15.8 ± 10.9 


3.2 - 


4.0 


3.6 


37 


(5.6 ±5.0) 


io- 


-3 


5.35 ± 4.65 


9.65 ±8.38 


12.2 ± 10.5 


4.0 - 


4.8 


1.1 


39 


(5.9 ±5.0) 


io- 


-3 


5.19 ±5.17 


9.52 ±9.49 


11.8 ± 11.8 



Table 1. Results for each redshift range, Az: mean redshift z; number of galaxies; amplitude of the angular correlation function at 1°, 
A; the spatial clustering amplitude ro for three different cosmologies (a) Clg = 1, A = 0, ho = 1; (b) Qo = 0.4, A = 0, ho = 0.65; 
(c) n = 0.4, A = 0.6, ho = 0.65. 



equation (8) by a factor 



(Az)- 



+ 1, where Az is the bin 



width and a = 0.1(z + 1) is the rms error on each redshift. 

The geometry of space will determine the comoving co- 
ordinate x, the curvature correction factor F(x) and the 
quantity P(f2o, z). In particular, for a Universe with generic 
density parameter Qo and cosmological constant A = (see 
e.g. Magliocchetti et al., 1998; Treyer & Lahav, 1996) we 
have: 



_2c 
Ho 



F(x) = 



and 



P(fio, 



n Q z - (Op - 2)(i - yT+ n z) 



ng(i + 2) 



i_(*5»y (no -i) 



1/2 



(9) 



(10) 



+ z) 2 (l + Q z) 



1/2 



4(f2 - 1)[(1 + fioz) 1 / 2 - 1] + - z) + 2Q z 



-1/2 



dz 



H ° ° Jo [(l + ,)3 + Q--l] 1/2 ' 

198 
nd 

P(Qo, z) = «o /2 [(l + zf + «o 1 - 1] 1/2 - 



the values obtained in the analysis of the clustering of the 
Lyman Break Galaxies (e.g., for Qo = 1, ro = 4 ± 1 - Adel- 
berger et al., 1998 and ro = 2.1 ± 0.5 - Giavalisco et al., 
1998, according to the particular sample used). Given the 
small volume sampled by the HDF data, the uncertainties 
are large, and the exact form of the evolution of ro is not 
well determined. The measurements are consistent with a 
slow decline followed by a smooth rise in amplitude above 
z > 2.4, as predicted by some galaxy formation models. 
From our measurements alone we cannot rule out a simple 
smooth increase of ro with z, but compared to local surveys 
our measurements at z ~ 1 require a significant drop in the 
correlation length between z = and z = 1. 



(11) 



In the case of a cosmological constant A / with f2o±A = 1 
(flat space) we have F(x) — 1, 



(12) 



(see Peebles, 1984; Magliocchetti et al., 1998; Treyer & La- 
hav, 1996) and 



(13) 



Using the values of obtained in section 3, and once 
again using 7 = 1.8 we find the values for the correlation 
length ro listed in Table 1, according to the different cos- 
mologies used in the deprojection analysis. Figure ^ shows 
the trend of ro as a function of the redshift 2 for the three 
cosmological models Qo = 1> A = 0, ho = 1 (left panel), 
flo = 0.4, A = 0, h = 0.65 (central panel) and fl = 0.4, 
A = 0.6, ho — 0.65 (right panel). Given the uncertainties 
in the determination of the photometric redshifts at high 
z's (see section 3), for z < 3.2 we have plotted the mea- 
surements coming from a Az = 0.4 binning (filled circles), 
while for z > 3.2 we only considered broader bins of width 
Az = 0.8 (empty circles). 

The figure shows that ro is roughly constant with look- 
back time for z < 2.4. Above this redshift, the clustering 
amplitude increases by more than a factor of 2. Our mea- 
surements at high redshifts are in excellent agreement with 



5 CONCLUSIONS 

We have calculated the angular correlation function of galax- 
ies in the HDF image as a function of redshift up to z ~ 4.8. 
We use the catalogue of photometric redshifts obtained by 
Fernandez-Soto et al., 1998 to select redshift limited sub- 
samples of width Az = 0.4, and Az = 0.8 between z ~ 0.4 
and z ~ 4.8. The results show that, while for z < 2.4 the 
clustering amplitude is roughly constant, for z > 2.4 there 
is a significant increase in amplitude. 

Converting the projected clustering amplitude into the 
correlation length ro at different redshifts, we find that ro(z) 
is roughly constant as a function of look-back time until 
z ~ 2.4; at higher redshifts the clustering amplitude rises 
to much higher values than for lower z's. Our high-redshift 
measurements are in good agreement with those obtained 
from the analysis of the Lyman Break Galaxies (Adelberger 
et al., 1998; Giavalisco et al., 1998). 

Under the assumption of linear evolution of mass fluctu- 
ations, we would expect a slow decrease in ro towards higher 
redshifts. However, we measure the clustering of galaxies, 
which are biased tracers of the mass. The bias level is un- 
likely to be a constant as a function of redshift; any galaxy 
seen beyond z ~ 2.4 has formed stars at an epoch earlier 
than most galaxies, and so is likely to be biased relative 
to an "average" galaxy. Also, the HDF galaxy sample is 
selected on observed frame / band and so different popula- 
tions of galaxies are selected in the different redshift ranges: 
at z ~ 1 the selection is roughly rest-frame B band, but at 
higher redshift samples are selected on rest-frame UV flux, 
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which will preferentially select the galaxies with higher star- 
formation rates. 

A further complication is caused by the effects of gravi- 
tational lensing. We expect that structure in the foreground 
mass distribution will introduce an extra component of clus- 
tering through the gravitational lensing magnification bias 
(Villumsen et al 1997) . The amplitude of this effect depends 
on the amplitude of mass fluctuations. 

Hence the interpretation of galaxy clustering in the 
HDF at high redshifts is not at all straightforward. Theoret- 
ical models (see for instance Kauffmann et al., 1998; Baugh 
et al., 1998) predict a trend for the evolution of the clustering 
signal with look-back time that is similar to what detected in 
our analysis, but their results are strongly model-dependent. 
We will present a detailed analysis and comparison to mod- 
els in a future paper. 
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